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Amendments to the Claims : 

This listing of claims will replace all prior versions, and 
listings, of claims in the application: 

5 

Listing of Claims : 

1. (currently amended) A method of controlling high-speed 
reading in a text-to-speech conversion system including a text 
analysis module for generating a phoneme and prosody character 
10 string from an input text; a prosody generation module for 

generating a synthesis parameter of at least a voice segment, a 
phoneme duration, and a fundamental frequency for said phoneme and 
prosody character string; a voice segment dictionary in which voice 
segments as a source of voice are registered; and a speech 
15 generation module for generating a synthetic waveform by waveform 
superimposition by referring to said voice segment dictionary, 

said method comprising the step steps of^ 

providing said prosody generation module with a phoneme 
duration determination unit that includes both a duration rule 
20 table containing empirically found phoneme durations and a duration 
prediction table containing phoneme durations predicted by 
statistical analysis^ 

designating an utterance speed; 

selecting one of the duration rule table and the duration 
25 prediction table according to the utterance speed; and 

determines determining a phoneme duration by using, when e 
user - designated the utterance speed exceeds a threshold contained 
in the duration rule table, said duration rule table and, when said 
utterance speed does not exceeded exceed the threshold, said 
30 duration prediction table. 
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2. (original) The method according to claim 1, wherein said 
threshold is a predetermined maximum utterance speed. 

3. (currently amended) A method of controlling high-speed 
5 reading in a text-to-speech conversion system including a text 

analysis module for generating a phoneme and prosody character 
string from an input text; a prosody generation module for 
generating a synthesis parameter of at least a voice segment, a 
phoneme duration, and a fundamental frequency for the phoneme and 

10 prosody character string; a voice segment dictionary in which voice 
segments as a source of voice are registered; and a speech 
generation module for generating a synthetic waveform by waveform 
superimposition while referring to said voice segment dictionary, 
said method comprising the otcp steps of^ 

15 providing said prosody generation module with a pitch contour 

determination unit that has both an empirically found a rule table 
empirically obtained and a prediction table predicted by 
statistical analysis^ 

designating an utterance speed; 

20 selecting one of the rule table and the prediction table 

according to the utterance speed; and 

determines determining a pitch contour by determining both 
using accent and phrase components with contained in , when a uocr - 
dcoignatcd the utterance speed exceeds a threshold contained in the 

25 rule table, said rule table and, when said utterance speed does not 
exceeded exceed the threshold, said prediction table. 



4. (original) The method according to claim 3, wherein said 
threshold is a predetermined maximum utterance speed. 

30 
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5. (original) A method of controlling high-speed reading in a 
text-to-speech conversion system including a text analysis module 
for generating a phoneme and prosody character string from an input 
text; a prosody generation module for generating a synthesis 

5 parameter of at least a voice segment, a phoneme duration, and a 

fundamental frequency for the phoneme and prosody character string; 
a voice segment dictionary in which voice segments as a source of 
voice are registered; and a speech generation module for generating 
a synthetic waveform by waveform superimposition by referring to 

10 said voice segment dictionary, 

said method comprising the step of providing said prosody 
generation module with a sound quality coefficient determination 
unit that has a sound quality conversion coefficient table for 
changing said voice segment to switch sound quality and selects 

15 from said sound quality conversion coefficient table such a 
coefficient that sound quality does not change when a user- 
designated utterance speed exceeds a threshold. 

6. (original) The method according to claim 5, wherein said 
20 threshold is a predetermined maximum utterance speed. 

7. (previously presented) A method of controlling high-speed 
reading in a text-to-speech conversion system including a text 
analysis module for generating a phoneme and prosody character 

25 string from an input text; a prosody generation module for 

generating a synthesis parameter of at least a voice segment, 
phoneme duration, and fundamental frequency for the phoneme and 
prosody character string; a voice segment dictionary in which voice 
segments as a source of voice are registered; and a speech 

30 generation module for generating a synthetic waveform by waveform 
superimposition by referring to said voice segment dictionary, 
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said method comprising the step of providing said prosody 
generation module with both a pitch contour correction unit for 
outputting a pitch contour corrected according to an intonation 
level designated by the user and a switch for determining whether a 
5 base pitch is added to said pitch contour corrected according to 
said user-designated utterance speed, said switch being controlled 
not to change the base pitch when the utterance speed exceeds a 
threshold. 

10 8. (original) The method according to claim 7, wherein said 

threshold is a predetermined maximum utterance speed. 

9. (original) The method according to claim 7, wherein said 
pitch contour correction unit performs a pitch contour generation 

15 process that includes a phrase component calculation process in 

which all phrases of an input sentence . are processed by calculating 
a phrase component by statistical analysis according to said user- 
designated utterance speed or making said phrase component zero and 
a process in which all words in said input sentence are processed 

20 by calculating an accent component by statistical analysis 

according to said user-designated utterance speed and either 
correcting said accent component according to said user designated 
intonation level or making said accent component zero. 

25 10. (original) A method of controlling high-speed reading in a 

text-to-speech conversion system including a text analysis module 
for generating a phoneme and prosody character string from an input 
text; a prosody generation module for generating a synthesis 
parameter of at least a voice segment, a phoneme duration, and a 

30 fundamental frequency for said phoneme and prosody character 

string; a voice segment dictionary in which voice segments as a 
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source of voice are registered; and a speech generation module for 
generating a synthetic waveform by waveform superimposition while 
referring to said voice segment dictionary, 

said method comprising the step of providing said speech 
5 generation module with signal sound generation means for inserting 
a signal sound between sentences to indicate an end of a sentence 
when a user-designated utterance speed exceeds a threshold. 

11. (original) The method according to claim 10, wherein said 
10 threshold is a predetermined maximum utterance speed. 

12. (original) A method of controlling high-speed reading in a 
text-to-speech conversion system including a text analysis module 
for generating a phoneme and prosody character string from an input 

15 text; a prosody generation module for generating a synthesis 

parameter of at least a voice segment, a phoneme duration, and a 
fundamental frequency for the phoneme and prosody character string; 
a voice segment dictionary in which voice segments as a source of 
voice are registered; and a speech generation module for generating 

20 a synthetic waveform by waveform superimposition by referring to 
said voice segment dictionary, 

said method comprising the step of providing said prosody 
generation module with a phoneme duration determination unit for 
performing a process in which when a user-designated utterance 

25 speed exceeds a threshold, an utterance speed of at least a leading 
word in a sentence is returned to a normal utterance speed. 

13. (original) The method according to claim 12, wherein said 
threshold is a predetermined maximum utterance speed. 

30 
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14. (original) The method according to claim 12, wherein said 
phoneme duration determination unit performs a process in which 
when a word under process is a leading word in a sentence and said 
user-designated utterance speed exceeds said threshold, a phoneme 
duration is not corrected and, when said word under process is not 
a leading word of a sentence or said user-designated utterance 
speed does not exceed said threshold, a first process by which a 
phoneme duration correction coefficient is changed according to 
said user-designated utterance speed and a second process in which 
all syllables of said word are processed by correcting a length of 
a vowel or vowels of said word, and carrying out said first and 
second processes for all words contained in the sentence. 

15. (new) A method of controlling high-speed reading in a 
text-to-speech conversion system, comprising: 

inputting a text into the text-to-speech conversion system; 

generating a phoneme and prosody character string of the text 
with a text analysis module; 

creating a duration rule table containing a first phoneme 
duration obtained empirically; 

creating a duration prediction table containing a second 
phoneme duration obtained through statistical analysis; 

designating an utterance speed; 

determining a threshold value; 

comparing the utterance speed with the threshold value; 

selecting one of the duration rule table and the duration 
prediction table according to the utterance speed; 

determining a third phoneme duration with a phoneme duration 
determination unit according to the one of the duration rule table 
and the duration prediction table; 
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generating a synthesis parameter of at least a voice segment, 
the third phoneme duration, and a fundamental frequency of the 
phoneme and prosody character string with a prosody generation 
module; and 

5 generating a synthetic waveform through waveform 

superimposition with a speech generation module according to the 
synthesis parameter and a voice segment dictionary containing a 
voice segment as a basic source of voice. 

10 16. (new) The method according to claim 15, in the step of 

selecting the one of the duration rule table and the duration 
prediction table according to the utterance speed, said duration 
rule table is selected when the utterance speed exceeds the 
threshold value, and said duration prediction table is selected 

15 when the utterance speed does not exceed the threshold value. 

17. (new) The method according to claim 15, in the step of 
determining the threshold value, said threshold value is determined 
to be a predetermined maximum utterance speed. 

20 

18. (new) A method of controlling high-speed reading in a 
text-to-speech conversion system, comprising: 

inputting a text into the text-to-speech conversion system; 
generating a phoneme and prosody character string of the text 
25 with a text analysis module; 

creating a rule table containing first data of accent and 
phrase components obtained empirically; 

creating a prediction table containing second data of accent 
and phrase components obtained through statistical analysis; 
30 designating an utterance speed; 

determining a threshold value; 
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comparing the utterance speed with the threshold value; 
selecting one of the rule table and the prediction table 

according to the utterance speed; 

determining a pitch contour with a pitch contour determination 

unit according to the one of the rule table and the prediction 

tables- 
generating a synthesis parameter of at least a voice segment , 

a phoneme duration, and a fundamental frequency of the phoneme and 

prosody character string with a prosody generation module; and 
generating a synthetic waveform through waveform 

superimposition with a speech generation module according to the 

synthesis parameter and a voice segment dictionary containing a 

voice segment as a basic source of voice. 

19. (new) The method according to claim 18, in the step of 
selecting the one of the rule table and the prediction table 
according to the utterance speed, said rule table is selected when 
the utterance speed exceeds the threshold value, and said 
prediction table is selected when the utterance speed does not 
exceed the threshold value. 

20. (new) The method according to claim 18, in the step of 
determining the threshold value, said threshold value is determined 
to be a predetermined maximum utterance speed. 



